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Description 

Determining a cause of a reduced performance of service in a 
communications network 

The present invention relates to a method for detecting a cause of a 
5 reduced performance of service in a coiranunications network, as well 
as a system for controlling and monitoring a communications network, 
and a control program for a device to monitor service quality and/ or 
faults. 

Network operators in the telecommunications market offer numerous 

10 services to customers, particularly Internet services. The services 
which network operators offer are divided into different service 
quality levels, in order to satisfy customer requirements according 
to guaranteed service levels and to stand out from competitors. For 
example low-cost services are on offer for private customers, and 

15 services with enhanced features are on offer at higher cost for 

business customers . Such enhanced service features may for instance 
include guaranteeing a high level of availability and low data loss. 
For this purpose network operators and their customers enter into 
service level agreements (SLA) defining the scope and quality of 

20 services to be provided to customers. Particular significance is 
then attached to proving that the service level agreement with a 
given customer has been maintained. Providing such proof requires 
services to be monitored, for example in order to record faults and 
determine the availability of a service. Measurements are also 

25 carried out in order to record service quality parameters such as 
data loss. The provision of services is therefore not simply 
confined to the administration of service parameters such as the 
bandwidth or call number, but also includes controlling additional, 
service-specific functions. 

30 To ensure that the effects of failing to maintain a service level 
agreement are kept to a minimum for both the network operator and 
the customer, reliable and rapid detection of reduced service 
performance is indispensable. 
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The object of the present invention is to create an efficient and 
reliable method for detecting a cause of a reduced performance of 
service, as well as to specify a suitable implementation of the 
method, along with a communications network control and monitoring 
5 system suitable for carrying out the method. 

This object is achieved according to the invention by a method with 
the features specified in Claim 1, a communications network control 
and monitoring system with the features specified in Claim 8 and a 
control program with the features specified in Claim 9. Advantageous 
10 embodiments of the method according to the invention are specified 
in the dependent Claims. 

Inventively, an essential precondition for efficiently and reliably 
detecting a cause of a reduced performance of service consists in 
first of all testing selected services for service quality and/or 

15 availability to determine any services which are affected by a 
reduced service performance. The test of service quality and/or 
^ availability can be adapted to requirements placed by users of a 
communications network on the services they use. By this means test 
results are as far as possible representative of the service quality 

20 and/or availability perceived by the users. As soon as the services 
affected by a reduced service performance are determined, network 
elements which are relevant to the provision of these services 
according to -information stored. in a network element database can be 
tested selectively to determine responsibility. This offers the 

25 advantage that it is unnecessary to subject largely similar network 
elements to continuous testing in order to forra opinions concerning 
services which are potentially faulty or at risk. The effect of the 
present invention is therefore to significantly reduce the total 
cost of monitoring network elements reliably. 

30 The present invention is explained in greater detail as an exemplary 
embodiment with reference to the drawings, in which; 

Figure 1 shows a schematic representation of a communications 
network control and monitoring system. 
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Figure 2 shows a flow chart of a method for determining a cause of 
a reduced performance of service in a communications 
network. 

The diagram of a communications network control and monitoring 
5 system shown in Figure 1 comprises a service provision device 101, a 
communication connection management device 102, a fault-monitoring 
device 104 and a service quality monitoring device 105. 

The service provision device 101 is intended for arranging or 
changing services. For this purpose the service provision device 101 
10 receives messages 121 with service requests and converts these into 
messages 122 with connection requests which are transmitted to the 
commxinication connection management device 102 . 

The communication connection management device 102 is intended for 
storing information that describes network elements, which are 
relevant to the provision of a service, with regard to their 
functional properties and their topological arrangement. This 
information is assigned to the appropriate service and stored in a 
network element database 103 associated with the communication 
connection management device 102 . The network elements that are 
relevant to the provision of a service include such items as network 
access points, ports and line connections along an end-to-end 
network path between two service access points . . Functional 
properties of network elements include bandwidth, the supported 
communications protocols and the switching technologies used. The 
description of the topological arrangement of network elements 
comprises sub-dividing the network elements into node network 
elements such as measurement points and switching points, and branch 
network elements such as line connections, in the form of 
topological information prepared according to a node-branch model. 
The information stored in the network element database 103 can be 
retrieved by the fault-monitoring device 104 and the service quality 
monitoring device 105. 

When a service is being arranged or changed, the service provision 
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device 101 sends the fault -monitoring device 104 -a message with a 
request to monitor the availability of network elements that are 
specif ied as relevant to the provision of the service concerned. In 
the same manner a message 125 with a request to monitor the service 
5 quality is transmitted to the service quality monitoring device 105. 
These monitoring requests cause the fault-monitoring device 104 and 
the service quality monitoring device 105 to compare fault messages 
127 and measured values 128, detected in sub-networks 106, 107, 108, 
and forwarded to the fault-monitoring device 104 or the service 

10 quality monitoring device 105 via a network operating system 109, 
110, 111 assigned to the respective sub-network, with the 
information on illegal deviations stored in the network element 
database 103. For this purpose corresponding network element 
database information is queried by the fault-monitoring device 104 

15 and/or the service quality monitoring device 105 and sent to them as 
messages 126. In the event of an illegal deviation from the 
information stored in the network element database 103 the fault- 
monitoring device 104 and/or the service quality monitoring device 
105 generate a message 129, 130 about a reduced service perf ormeuice, 

20 specifying the service affected. 

Availability and service quality are monitored in accordance with a 
service level agreement between a customer and a network operator . 
Therefore the network element database 103 only stores information 
on the network elements specified by a service level agreement as 

25 relevant to the provision of a service. Furthermore only fault 

messages and/or measured values about the network elements specified 
by a service level agreement as relevant to the provision of the 
service are detected. For the purpose of monitoring the rules 
defined in the context of a service level agreement, a message 123 

30 with a request to monitor a service level agreement is transmitted 
by the service provision device 101 to the fault-monitoring device 
104 when a service is being arranged or changed. This means that the 
detection of fault messages and/or measured values about the network 
elements specified by a service level agreement as relevant to the 

35 provision of a service is initiated when the service concerned is 
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arranged or changed. 

If the evaluation of a measured value 128 in the service quality 
monitoring device 105 reveals that a network element is being 
operated outside the permissible operating range, the service 
5 quality monitoring device 105 sends an alarm message 129 about an 
infringement of a service quality criterion to the fault-monitoring 
device 104 where the message is converted into an alarm message 130 
about an infringement of a service level agreement. A fault message 
127 is converted in the fault-monitoring device 104 directly into an 

10 alarm message about infringement of a service level agreement. The 
alarm message 130 contains a statement about the service 
availability and/or service quality and, for the purpose of 
remedying the reduced service performance, is transmitted via a 
network operating system 109, 110, 111 assigned to the sub-network 

15 106, 107, 108 in which a fault or an infringement of the service 

quality criterion is taking place. The appropriate network operating 
system 109, 110, 111 converts the alarm message 130 into a control 
command 131 and this, in the form of a message for the purpose of 
remedying the reduced service performance, is transmitted to a 

20 selected control device (not shown) in the respective sub-network 
106, 107, 108. For the purpose of remedying the reduced service 
performance, the network element affected by the action to remedy 
the reduced service performance is reconfigured by the appropriate 
network operating system 109, 110, 111 which accesses the 

25 information stored in the network element database 103. This also 
applies to the configuration of a network element when a service is 
being arranged, changed or deleted. 

The alarm message 130 about an infringement of a service level 
agreement is also transmitted to the service provision device 101. 
30 There it is processed with customer data and converted into a report 
132 about compliance with or infringement of a service level 
agreement . 

The current view is that it is not realistic to solve every problem 
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that arises in a communications network as quickly as possible in 
order to guarantee service quality and availability. Due to the 
plurality of services offered to the customers of network operators 
on the basis of numerous service level agreements, the starting 
5 point for any approach to remedying network problems should always 
be a customer-centered or service-oriented view of available network 
resources or elements. This makes it necessary to test service 
quality and availability as perceived by the customer. Important 
criteria for such a test are service quality parameters such as data 

10 transmission rate, latency and phase variations leading to data 

loss. For this reason it is helpful to check which network problems 
have an effect on which services, using this information to 
prioritize the remedying of network problems, for example by 
reference to service level agreements. Great significance is also 

15 attached to early detection of existing network problems which may 
quickly cause services to become faulty or place them at risk. 

In order to cope with these requirements, the present invention 
brings together approaches aimed at monitoring network elements or 
network resources at the physical level 'on the one hand, and 

20 monitoring service quality and availability at the application level 
on the other. Approaches aimed at monitoring on the physical level 
only are very close to their limits, particularly in extended, 
complex networks, since in such cases a plurality of network 
elements and network resources have to be monitored. In particular 

25 there is a problem that if a central network resource fails it 

triggers a plurality of consecutive alarm messages. This means that 
in such critical situations an enormous quantity of information has 
to be processed under time-critical conditions. A first approach to 
handling this problem consists in filtering out secondary alarm 

30 messages. Whilst approaches aimed at monitoring purely at the 
application level offer the advantage of a service-oriented or 
customer-centered view together with pre-consolidation of 
information regarding the evaluation of alarm messages, they have 
the disadvantage that test results cannot be used to form opinions 

35 about the causes of failures or problems." According to the 
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invention, both the approaches mentioned are combined by providing 
information concerning the network and service topology. 

The provision 201 of information concerning the network and service 
topology is the starting point of the inventive method for 
5 determining a cause of a reduced performance of service in a 

communications network, the sequence of which is explained by means 
of the flow chart shown in Figure 2 . For this purpose network 
elements which are relevant to the provision of a service are 
described with regard to their functional properties and their 

10 topological arrangement, the associated descriptive information 

being stored in the network element database 103 and assigned to the 
appropriate service. This is followed by an arrangement 202 for 
testing provided services and an arrangement 203 to establish a time 
plan for testing the selected services with respect to service 

15 availability and/or service quality. Establishing the arrangement 
for the services and time plan is followed at preset times and for 
the selected services by the testing 204 of service availability 
and/or service quality. This is followed by an evaluation 205 of 
test results, said test results being used to determine services 

20 which are faulty and/or at risk, and which are either immediately or 
potentially affected by a reduced service performance. The 
information concerning the network and service topology stored in 
the network element database 103 is used to carry out a 
determination 206 of the network elements to be tested. The network 

25 elements determined as needing to be tested are those network 

elements which are identified in the network element database 103 as 
relevant to the provision of the service affected by the reduced 
service performance. There then follows testing 2 07 of the network 
elements determined as above, and lastly evaluation 208 of the test 

30 results and determination of the network elements responsible for 
the reduction in service performance. 

Testing 204 of the selected services with respect to service 
availability and/or service quality is carried out by means of a 
comparison with the information stored in the network element 
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database 103 . The information stored in the network element database 
describes permissible operating ranges for the network elements. 
This information is used for the testing 207 of the network elements 
determined as above, in order to determine a cause for the reduced 
5 service performance. 

The network elements so determined are tested to determine the cause 
of a reduced service performance by such means as detecting status 
or fault messages for the network elements concerned. These messages 
can be monitored by the fault-monitoring device 104 or the service 

10 quality monitoring device 105 either actively or passively. 

Furthermore such monitoring can also apply to data processing 
devices (not shown in Figure 1) which are connected to the 
communications network shown in Figure 1, as well as to operating 
system programs or application programs running on these data 

15 processing devices. 

For the purpose of testing service availability and/or service 
quality, the fault-monitoring device 104 and/or the service quality 
monitoring device 105 provide control sequences containing preset 
service access requests to be executed on or by selected service 

20 access points in the sub-networks 106, 107, 108. This means that a 
virtual user is set up at the service access point concerned, and 
this user's behavior is mapped by the service access requests 
contained in the control sequence . Such a virtual user can be 
perceived both as a message source and as a message sink. Since a 

25 virtual user is in the main fully controllable by the fault- 
monitoring device 104 or the service quality monitoring device 105, 
it can in principle be located on any service access point, for 
example near the service access points of real users or on 
interfaces between sub-networks. The control sequences containing 

30 the preset service access requests can for example emulate the 
following scenarios: 



"Internet access" - 
pages , 



querying one or more reference Internet 
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"E-mail" - sending a reference e-mail to an e-mail server and 
querying the e-mail from the e-mail server, 

"File transfer" - querying a reference file from a file server, 
"Domain name access" - resolving logical Internet addresses into 
physical Internet addresses, 

"TCP (transmission control protocol) " - establishing a TCP 



connection, 



DHCP (dynamic host configuration protocol) " - 



querying an 



Internet address, 



10 - "UDP (user datagram protocol)" - sending a UDF data packet from a 
data source to a data sink. 

Moreover network elements that are essential to the provision of a 
selected service can be tested at preset times for their ability to 
operate, without the service first having to be tested with respect 

15 to service quality and/or service availability. This offers the 

advantage that particularly critical network resources, even those 
without break points for reduced service performance, can be tested 
directly. Direct testing of the ability to operate can also include 
terminals for which testing at selected times has been defined by a 

20 service level agreement between a customer and a network operator. 
Thus testing of network elements or network resources can refer not 
only to network operators ' own network elements or network 
resources, but also to customers' own terminals. 

The method described above for detecting reduced service performance 
25 can also determine faults in network components which from the 

network operator's point of view do not belong to that operator's 
network but rather to a third-party network and are not directly 
controllable from the network operator's point of view. 

The steps to which the method relates, which are carried out in the 
30 service provision device 101, the communication connection 

management device 102, the fault-monitoring device 104 and the 
service quality monitoring device 105, are implemented in each case 
by a control program provided for the service provision device 101, 
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the communication connection management device 102, the fault- 
monitoring device 104 and the service quality monitoring device 105. 
The respective control program runs on a data processing device 
assigned to the service provision device 101, the communication 
5 connection management device 102, the fault-monitoring device 104 
and/or the service quality monitoring device- 105. Depending on the 
application, it is also possible to use a common data processing 
device on which the said control programs run either separately or 
as combined control programs. 

10 The application of the present invention is not confined to the 
typical embodiment described here. 
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Claims 

1. Method for determining a cause of a reduced performance of 
service in a communications network, in which 

information that describes the functional properties and • 
5 topological arrangement of network elements which are relevant to 

the provision of a service is stored in a network element 
database and assigned to the service, 

service availability and/or service quality for a selected 
service are tested at preset times, 
10 - a service affected by a reduced performance of service is 
determined based on a result of this testing, 

network elements relevant to the provision of the service, which 
is affected by the reduced performance of service, are determined 
based on the information stored in the network element database, 
15 - the determined network elements are tested in order to determine 
a cause of the reduced performance of service. 

2. Method according to Claim 1, 

in which the testing for service availability and/or service quality 
is carried out by means of a comparison with the information stored 
20 in the network element database. 

3 . Method according to Claims 1 or 2 , 

in which the information stored in the network element database 
describes permissible operating ranges for the network elements and 
the determined network elements are tested in order to determine a 
25 cause of the reduced performance of service. 

4 . Method according to Claims 1 to 3 , 

in which the determined network elements are tested to determine a 
cause of a reduced service performance by detecting status and/or 
fault messages for the said network elements. 

30 5. Method according to Claims 1 to 4, 

in which for the purpose of testing service availability and/or 
service quality, control sequences containing preset service access 
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requests to be executed on or by selected service access points are 
provided. 

6 . Method according to Claims 1 to 5 , 

in which network elements determined as essential for the provision 
of a selected service are tested at preset times for their ability 
to operate. 

7. Method according to Claims 1 to 6, 

in which terminals specified by a service level agreement are tested 
at selected times for their ability to operate. 

8. Communications network control and monitoring system with 

a communication connection management device for storing 
information that describes the functional properties and 
topological arrangement of network elements which are relevant to 
the provision of a service in a network element database and for 
assigning this information to the service, 

a service quality and/or fault-monitoring device for causing a 
test of service availability and/or service quality for a 
selected service at preset times, for determining a service 
affected by a reduced service performance with the aid of a 
result of the test, for determining network elements relevant to 
the provision of the service affected by the reduced service 
performance with the aid of the information stored in the network 
element database and for causing a test of the determined network 
elements in order to determine a cause of the reduced service 
performance . 

9. Control program for a service quality and/or fault-monitoring 
device of a communications network control and monitoring system 
according to Claim 8, which can be loaded into a random access 
memory of a data processing device assigned to the service quality 
and/or fault-monitoring device and has at least one code section, 
during the execution of which 

service availability and/or service quality for a selected 
service are tested at preset times. 
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a service affected by a reduced performance of service is 
determined based on a result of this testing, 

network elements relevant to the provision of the service, which 
is affected by the reduced performance of service, are 
5 determined based on the information stored in the network 

element database, 

the determined network elements are tested in order to determine 
a cause of the reduced performance of service, 

when the computer program is running in the data processing device. 
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